Estimating the Dispersion Parameter of the Negative Binomial Distribution for Analyzing Crash Data Using a Bootstrapped Maximum Likelihood Method

نویسندگان

  • Yunlong Zhang
  • Zhirui Ye
  • Dominique Lord
چکیده

The objective of this study is to improve the estimation of the dispersion parameter of the negative binomial distribution for modeling motor vehicle collisions. The negative binomial distribution is widely used to model count data such as traffic crash data, which often exhibit low sample mean values and small sample sizes. Under such situations, the most commonly used methods for estimating the dispersion parameter, the method of moment and the maximum likelihood estimate, may become inaccurate and unstable. A bootstrapped maximum likelihood estimate is proposed to improve the estimation of the dispersion parameter. The proposed method combines the technique of bootstrap resampling with the maximum likelihood estimation method to obtain better estimates of the dispersion parameter. The performance of the bootstrapped maximum likelihood estimate is compared with the method of moment and the maximum likelihood estimates through Monte Carlo simulations. To validate the simulation results, the methods are applied to observed data collected at 4-legged unsignalized intersections in Toronto, Ont. Overall, the results show that the proposed bootstrap maximum likelihood method produces smaller biases and more stable estimates. The improvements are more pronounced with small samples and low sample means.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of estimation methods for parameters of the probability functions in tree diameter distribution modeling

One of the most commonly used statistical models for characterizing the variations of tree diameter at breast height is Weibull distribution. The usual approach for estimating parameters of a statistical model is the maximum likelihood estimation (likelihood method). Usually, this works based on iterative algorithms such as Newton-Raphson. However, the efficiency of the likelihood method is not...

متن کامل

Adjustment for the Maximum Likelihood Estimate of the Negative Binomial Dispersion Parameter

Negative Binomial (or Poisson-gamma) model has been used extensively by highway safety analysts because it can accommodate the over-dispersion, often exhibited in crash data. However, it has been reported in the literature that the maximum likelihood estimate of the dispersion parameter of NB models can be significantly affected when the data are characterized by small sample size and low sampl...

متن کامل

Beta - Binomial and Ordinal Joint Model with Random Effects for Analyzing Mixed Longitudinal Responses

The analysis of discrete mixed responses is an important statistical issue in various sciences. Ordinal and overdispersed binomial variables are discrete. Overdispersed binomial data are a sum of correlated Bernoulli experiments with equal success probabilities. In this paper, a joint model with random effects is proposed for analyzing mixed overdispersed binomial and ordinal longitudinal respo...

متن کامل

Does the Dispersion Parameter of Negative Binomial Models Truly Estimate the Level of Dispersion in Over-dispersed Crash data with a Long Tail?

Despite many statistical models that have been proposed for modeling motor vehicle crashes, the most commonly used statistical tool remains the Negative binomial (NB) model. Crash data collected for safety studies may exhibit over-dispersion and a long tail (i.e., a few sites have unusually high number of crashes). However, some studies have shown that NB models cannot handle over-dispersed cou...

متن کامل

Hurdle, Inflated Poisson and Inflated Negative Binomial Regression Models ‎ for Analysis of Count Data with Extra Zeros

In this paper‎, ‎we ‎propose ‎Hurdle regression models for analysing count responses with extra zeros‎. A method of estimating maximum likelihood is used to estimate model parameters. The application of the proposed model is presented in insurance dataset‎. In this example‎, there are many numbers of claims equal to zero is considered that clarify the application of the model with a zero-inflat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006